Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 22191 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.9 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 11 |
|---|
AT is highly correlated with AH and 1 other fields | High correlation |
AH is highly correlated with AT | High correlation |
AFDP is highly correlated with GTEP and 3 other fields | High correlation |
GTEP is highly correlated with AFDP and 4 other fields | High correlation |
TIT is highly correlated with AFDP and 4 other fields | High correlation |
TEY is highly correlated with AFDP and 4 other fields | High correlation |
CDP is highly correlated with AFDP and 4 other fields | High correlation |
CO is highly correlated with GTEP and 3 other fields | High correlation |
NOX is highly correlated with AT | High correlation |
AT is highly correlated with AH and 1 other fields | High correlation |
AH is highly correlated with AT | High correlation |
AFDP is highly correlated with GTEP and 3 other fields | High correlation |
GTEP is highly correlated with AFDP and 5 other fields | High correlation |
TIT is highly correlated with AFDP and 4 other fields | High correlation |
TAT is highly correlated with GTEP and 2 other fields | High correlation |
TEY is highly correlated with AFDP and 5 other fields | High correlation |
CDP is highly correlated with AFDP and 5 other fields | High correlation |
CO is highly correlated with GTEP and 3 other fields | High correlation |
NOX is highly correlated with AT | High correlation |
AFDP is highly correlated with GTEP and 1 other fields | High correlation |
GTEP is highly correlated with AFDP and 4 other fields | High correlation |
TIT is highly correlated with AFDP and 4 other fields | High correlation |
TEY is highly correlated with GTEP and 3 other fields | High correlation |
CDP is highly correlated with GTEP and 3 other fields | High correlation |
CO is highly correlated with GTEP and 3 other fields | High correlation |
AT is highly correlated with AP and 7 other fields | High correlation |
AP is highly correlated with AT | High correlation |
AH is highly correlated with AT | High correlation |
AFDP is highly correlated with GTEP and 5 other fields | High correlation |
GTEP is highly correlated with AT and 7 other fields | High correlation |
TIT is highly correlated with AT and 7 other fields | High correlation |
TAT is highly correlated with AT and 5 other fields | High correlation |
TEY is highly correlated with AT and 7 other fields | High correlation |
CDP is highly correlated with AT and 7 other fields | High correlation |
CO is highly correlated with AFDP and 5 other fields | High correlation |
NOX is highly correlated with AT and 5 other fields | High correlation |
Reproduction
| Analysis started | 2022-07-01 15:14:45.627541 |
|---|---|
| Analysis finished | 2022-07-01 15:15:18.930121 |
| Duration | 33.3 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 15988 |
|---|---|
| Distinct (%) | 72.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.71224675 |
| Minimum | 0.28985 |
|---|---|
| Maximum | 34.929 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 0.28985 |
|---|---|
| 5-th percentile | 5.99275 |
| Q1 | 11.6645 |
| median | 17.739 |
| Q3 | 23.657 |
| 95-th percentile | 29.399 |
| Maximum | 34.929 |
| Range | 34.63915 |
| Interquartile range (IQR) | 11.9925 |
Descriptive statistics
| Standard deviation | 7.352788794 |
|---|---|
| Coefficient of variation (CV) | 0.4151245688 |
| Kurtosis | -0.9509019253 |
| Mean | 17.71224675 |
| Median Absolute Deviation (MAD) | 6.006 |
| Skewness | 0.008794502091 |
| Sum | 393052.4676 |
| Variance | 54.06350305 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11.92 | 6 | < 0.1% |
| 10.992 | 6 | < 0.1% |
| 23.969 | 6 | < 0.1% |
| 12.068 | 6 | < 0.1% |
| 25.597 | 6 | < 0.1% |
| 11.288 | 6 | < 0.1% |
| 18.091 | 6 | < 0.1% |
| 24.247 | 5 | < 0.1% |
| 14.662 | 5 | < 0.1% |
| 17.203 | 5 | < 0.1% |
| Other values (15978) | 22134 |
| Value | Count | Frequency (%) |
| 0.28985 | 1 | |
| 0.38289 | 1 | |
| 0.5037 | 1 | |
| 0.5223 | 1 | |
| 0.58759 | 1 | |
| 0.60394 | 1 | |
| 0.76744 | 1 | |
| 0.78907 | 1 | |
| 0.83101 | 1 | |
| 0.86433 | 1 |
| Value | Count | Frequency (%) |
| 34.929 | 1 | |
| 34.903 | 1 | |
| 34.831 | 1 | |
| 34.748 | 1 | |
| 34.665 | 1 | |
| 34.619 | 1 | |
| 34.598 | 1 | |
| 34.532 | 1 | |
| 34.491 | 1 | |
| 34.483 | 1 |
| Distinct | 670 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1012.812607 |
| Minimum | 985.85 |
|---|---|
| Maximum | 1034.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 985.85 |
|---|---|
| 5-th percentile | 1002.8 |
| Q1 | 1008.8 |
| median | 1012.4 |
| Q3 | 1016.7 |
| 95-th percentile | 1024 |
| Maximum | 1034.2 |
| Range | 48.35 |
| Interquartile range (IQR) | 7.9 |
Descriptive statistics
| Standard deviation | 6.396587838 |
|---|---|
| Coefficient of variation (CV) | 0.006315667671 |
| Kurtosis | 0.3759747746 |
| Mean | 1012.812607 |
| Median Absolute Deviation (MAD) | 3.9 |
| Skewness | 0.06007877594 |
| Sum | 22475324.56 |
| Variance | 40.91633597 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1012.1 | 191 | 0.9% |
| 1010.8 | 188 | 0.8% |
| 1011.8 | 188 | 0.8% |
| 1011.9 | 177 | 0.8% |
| 1012 | 177 | 0.8% |
| 1011.4 | 173 | 0.8% |
| 1011.1 | 172 | 0.8% |
| 1012.4 | 170 | 0.8% |
| 1013.6 | 169 | 0.8% |
| 1010.9 | 168 | 0.8% |
| Other values (660) | 20418 |
| Value | Count | Frequency (%) |
| 985.85 | 1 | |
| 986.16 | 1 | |
| 986.25 | 1 | |
| 986.41 | 2 | |
| 986.43 | 1 | |
| 986.56 | 1 | |
| 986.78 | 1 | |
| 986.87 | 1 | |
| 987.31 | 1 | |
| 987.43 | 1 |
| Value | Count | Frequency (%) |
| 1034.2 | 1 | < 0.1% |
| 1034 | 1 | < 0.1% |
| 1033.9 | 1 | < 0.1% |
| 1033.4 | 2 | < 0.1% |
| 1033.2 | 1 | < 0.1% |
| 1033 | 6 | |
| 1032.8 | 2 | < 0.1% |
| 1032.6 | 1 | < 0.1% |
| 1032.4 | 1 | < 0.1% |
| 1032.3 | 1 | < 0.1% |
| Distinct | 17316 |
|---|---|
| Distinct (%) | 78.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.55522437 |
| Minimum | 27.504 |
|---|---|
| Maximum | 100.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 27.504 |
|---|---|
| 5-th percentile | 52.9385 |
| Q1 | 70.2945 |
| median | 82.781 |
| Q3 | 90.532 |
| 95-th percentile | 97.5305 |
| Maximum | 100.2 |
| Range | 72.696 |
| Interquartile range (IQR) | 20.2375 |
Descriptive statistics
| Standard deviation | 13.91501847 |
|---|---|
| Coefficient of variation (CV) | 0.1749101782 |
| Kurtosis | -0.2243244766 |
| Mean | 79.55522437 |
| Median Absolute Deviation (MAD) | 9.252 |
| Skewness | -0.7179519553 |
| Sum | 1765409.984 |
| Variance | 193.6277391 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 100.14 | 46 | 0.2% |
| 100.12 | 46 | 0.2% |
| 100.15 | 42 | 0.2% |
| 100.16 | 42 | 0.2% |
| 100.11 | 38 | 0.2% |
| 100.13 | 33 | 0.1% |
| 100.17 | 27 | 0.1% |
| 100.09 | 22 | 0.1% |
| 100.1 | 20 | 0.1% |
| 100.06 | 13 | 0.1% |
| Other values (17306) | 21862 |
| Value | Count | Frequency (%) |
| 27.504 | 1 | |
| 30.344 | 1 | |
| 30.899 | 1 | |
| 31.204 | 1 | |
| 31.964 | 1 | |
| 32.617 | 1 | |
| 32.789 | 1 | |
| 32.792 | 1 | |
| 33.023 | 1 | |
| 33.264 | 1 |
| Value | Count | Frequency (%) |
| 100.2 | 4 | < 0.1% |
| 100.19 | 1 | < 0.1% |
| 100.18 | 5 | < 0.1% |
| 100.17 | 27 | |
| 100.16 | 42 | |
| 100.15 | 42 | |
| 100.14 | 46 | |
| 100.13 | 33 | |
| 100.12 | 46 | |
| 100.11 | 38 |
| Distinct | 15228 |
|---|---|
| Distinct (%) | 68.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.037750016 |
| Minimum | 2.0874 |
|---|---|
| Maximum | 7.6106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 2.0874 |
|---|---|
| 5-th percentile | 2.74735 |
| Q1 | 3.44985 |
| median | 4.0688 |
| Q3 | 4.4514 |
| 95-th percentile | 5.54575 |
| Maximum | 7.6106 |
| Range | 5.5232 |
| Interquartile range (IQR) | 1.00155 |
Descriptive statistics
| Standard deviation | 0.8102228958 |
|---|---|
| Coefficient of variation (CV) | 0.2006619758 |
| Kurtosis | 0.168933672 |
| Mean | 4.037750016 |
| Median Absolute Deviation (MAD) | 0.4792 |
| Skewness | 0.3755179931 |
| Sum | 89601.7106 |
| Variance | 0.6564611409 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.1286 | 7 | < 0.1% |
| 4.5032 | 7 | < 0.1% |
| 4.0934 | 6 | < 0.1% |
| 4.2322 | 6 | < 0.1% |
| 4.1024 | 6 | < 0.1% |
| 4.1256 | 6 | < 0.1% |
| 4.4361 | 6 | < 0.1% |
| 4.4816 | 6 | < 0.1% |
| 4.4273 | 6 | < 0.1% |
| 3.8837 | 6 | < 0.1% |
| Other values (15218) | 22129 |
| Value | Count | Frequency (%) |
| 2.0874 | 1 | |
| 2.0992 | 1 | |
| 2.1057 | 1 | |
| 2.1197 | 1 | |
| 2.1395 | 1 | |
| 2.1441 | 1 | |
| 2.1597 | 1 | |
| 2.1673 | 1 | |
| 2.185 | 1 | |
| 2.1866 | 1 |
| Value | Count | Frequency (%) |
| 7.6106 | 1 | |
| 7.5549 | 1 | |
| 7.3189 | 1 | |
| 7.2399 | 1 | |
| 6.9831 | 1 | |
| 6.9779 | 1 | |
| 6.956 | 1 | |
| 6.9312 | 1 | |
| 6.927 | 1 | |
| 6.9259 | 1 |
| Distinct | 9827 |
|---|---|
| Distinct (%) | 44.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.31787252 |
| Minimum | 17.878 |
|---|---|
| Maximum | 37.402 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 17.878 |
|---|---|
| 5-th percentile | 19.239 |
| Q1 | 22.736 |
| median | 24.989 |
| Q3 | 26.839 |
| 95-th percentile | 32.898 |
| Maximum | 37.402 |
| Range | 19.524 |
| Interquartile range (IQR) | 4.103 |
Descriptive statistics
| Standard deviation | 4.234147408 |
|---|---|
| Coefficient of variation (CV) | 0.1672394632 |
| Kurtosis | -0.6488366776 |
| Mean | 25.31787252 |
| Median Absolute Deviation (MAD) | 2.049 |
| Skewness | 0.3899686595 |
| Sum | 561828.909 |
| Variance | 17.92800428 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 24.308 | 13 | 0.1% |
| 25.269 | 12 | 0.1% |
| 25.226 | 12 | 0.1% |
| 25.006 | 11 | < 0.1% |
| 24.263 | 11 | < 0.1% |
| 20.296 | 10 | < 0.1% |
| 25.552 | 10 | < 0.1% |
| 24.391 | 10 | < 0.1% |
| 25.487 | 10 | < 0.1% |
| 23.893 | 10 | < 0.1% |
| Other values (9817) | 22082 |
| Value | Count | Frequency (%) |
| 17.878 | 1 | |
| 17.912 | 1 | |
| 17.939 | 1 | |
| 17.966 | 1 | |
| 17.974 | 1 | |
| 18.028 | 1 | |
| 18.037 | 1 | |
| 18.039 | 1 | |
| 18.065 | 1 | |
| 18.079 | 1 |
| Value | Count | Frequency (%) |
| 37.402 | 1 | |
| 37.34 | 1 | |
| 37.189 | 1 | |
| 37.172 | 1 | |
| 37.068 | 1 | |
| 36.973 | 1 | |
| 36.959 | 1 | |
| 36.95 | 1 | |
| 36.917 | 1 | |
| 36.844 | 1 |
| Distinct | 722 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1083.08028 |
| Minimum | 1000.8 |
|---|---|
| Maximum | 1100.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 1000.8 |
|---|---|
| 5-th percentile | 1052.1 |
| Q1 | 1074.6 |
| median | 1088.1 |
| Q3 | 1095.3 |
| 95-th percentile | 1100.1 |
| Maximum | 1100.8 |
| Range | 100 |
| Interquartile range (IQR) | 20.7 |
Descriptive statistics
| Standard deviation | 16.84076501 |
|---|---|
| Coefficient of variation (CV) | 0.01554895358 |
| Kurtosis | 0.0004342854174 |
| Mean | 1083.08028 |
| Median Absolute Deviation (MAD) | 9.1 |
| Skewness | -1.025935614 |
| Sum | 24034634.5 |
| Variance | 283.611366 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1100 | 1348 | 6.1% |
| 1099.9 | 1059 | 4.8% |
| 1100.1 | 816 | 3.7% |
| 1099.8 | 493 | 2.2% |
| 1100.2 | 447 | 2.0% |
| 1100.3 | 249 | 1.1% |
| 1099.7 | 221 | 1.0% |
| 1099.6 | 121 | 0.5% |
| 1085.5 | 119 | 0.5% |
| 1090 | 116 | 0.5% |
| Other values (712) | 17202 |
| Value | Count | Frequency (%) |
| 1000.8 | 1 | |
| 1001.3 | 1 | |
| 1001.4 | 2 | |
| 1009.5 | 1 | |
| 1018.3 | 1 | |
| 1022.1 | 1 | |
| 1023.9 | 1 | |
| 1024.4 | 1 | |
| 1024.5 | 1 | |
| 1024.6 | 1 |
| Value | Count | Frequency (%) |
| 1100.8 | 1 | < 0.1% |
| 1100.6 | 3 | < 0.1% |
| 1100.5 | 15 | 0.1% |
| 1100.4 | 85 | 0.4% |
| 1100.3 | 249 | 1.1% |
| 1100.2 | 447 | 2.0% |
| 1100.1 | 816 | |
| 1100 | 1348 | |
| 1099.9 | 1059 | |
| 1099.8 | 493 | 2.2% |
| Distinct | 2587 |
|---|---|
| Distinct (%) | 11.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 545.5201699 |
| Minimum | 512.45 |
|---|---|
| Maximum | 550.61 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 512.45 |
|---|---|
| 5-th percentile | 528.35 |
| Q1 | 542.6 |
| median | 549.9 |
| Q3 | 550.05 |
| 95-th percentile | 550.28 |
| Maximum | 550.61 |
| Range | 38.16 |
| Interquartile range (IQR) | 7.45 |
Descriptive statistics
| Standard deviation | 7.708707885 |
|---|---|
| Coefficient of variation (CV) | 0.01413093101 |
| Kurtosis | 0.8743698528 |
| Mean | 545.5201699 |
| Median Absolute Deviation (MAD) | 0.24 |
| Skewness | -1.498289663 |
| Sum | 12105638.09 |
| Variance | 59.42417725 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 550 | 436 | 2.0% |
| 550.01 | 425 | 1.9% |
| 549.98 | 416 | 1.9% |
| 549.99 | 409 | 1.8% |
| 549.97 | 403 | 1.8% |
| 549.96 | 402 | 1.8% |
| 550.03 | 396 | 1.8% |
| 550.04 | 385 | 1.7% |
| 550.02 | 380 | 1.7% |
| 549.95 | 369 | 1.7% |
| Other values (2577) | 18170 |
| Value | Count | Frequency (%) |
| 512.45 | 1 | |
| 512.6 | 2 | |
| 513.06 | 1 | |
| 513.09 | 1 | |
| 513.17 | 1 | |
| 513.29 | 1 | |
| 513.47 | 1 | |
| 513.75 | 1 | |
| 514.3 | 1 | |
| 514.43 | 1 |
| Value | Count | Frequency (%) |
| 550.61 | 1 | < 0.1% |
| 550.57 | 1 | < 0.1% |
| 550.56 | 1 | < 0.1% |
| 550.53 | 2 | < 0.1% |
| 550.52 | 1 | < 0.1% |
| 550.51 | 1 | < 0.1% |
| 550.5 | 1 | < 0.1% |
| 550.49 | 2 | < 0.1% |
| 550.48 | 5 | |
| 550.47 | 1 | < 0.1% |
| Distinct | 5013 |
|---|---|
| Distinct (%) | 22.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 133.5373931 |
| Minimum | 100.17 |
|---|---|
| Maximum | 174.61 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 100.17 |
|---|---|
| 5-th percentile | 109.23 |
| Q1 | 124.26 |
| median | 133.77 |
| Q3 | 138.645 |
| 95-th percentile | 161.935 |
| Maximum | 174.61 |
| Range | 74.44 |
| Interquartile range (IQR) | 14.385 |
Descriptive statistics
| Standard deviation | 16.02610712 |
|---|---|
| Coefficient of variation (CV) | 0.1200121311 |
| Kurtosis | -0.5534286352 |
| Mean | 133.5373931 |
| Median Absolute Deviation (MAD) | 7.74 |
| Skewness | 0.1453197274 |
| Sum | 2963328.29 |
| Variance | 256.8361095 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 133.78 | 173 | 0.8% |
| 133.74 | 167 | 0.8% |
| 133.67 | 154 | 0.7% |
| 133.76 | 154 | 0.7% |
| 133.79 | 143 | 0.6% |
| 133.75 | 132 | 0.6% |
| 133.73 | 131 | 0.6% |
| 133.72 | 130 | 0.6% |
| 133.68 | 127 | 0.6% |
| 133.77 | 126 | 0.6% |
| Other values (5003) | 20754 |
| Value | Count | Frequency (%) |
| 100.17 | 1 | |
| 100.32 | 1 | |
| 100.52 | 1 | |
| 100.83 | 1 | |
| 100.96 | 1 | |
| 101.15 | 1 | |
| 101.48 | 1 | |
| 101.62 | 1 | |
| 101.66 | 1 | |
| 101.71 | 1 |
| Value | Count | Frequency (%) |
| 174.61 | 1 | |
| 174.4 | 1 | |
| 174.25 | 1 | |
| 173.92 | 1 | |
| 173.43 | 1 | |
| 173.26 | 1 | |
| 172.97 | 1 | |
| 172.96 | 1 | |
| 172.54 | 2 | |
| 172.15 | 1 |
| Distinct | 3977 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.06020766 |
| Minimum | 9.8754 |
|---|---|
| Maximum | 15.081 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 9.8754 |
|---|---|
| 5-th percentile | 10.405 |
| Q1 | 11.395 |
| median | 12.001 |
| Q3 | 12.4435 |
| 95-th percentile | 14.035 |
| Maximum | 15.081 |
| Range | 5.2056 |
| Interquartile range (IQR) | 1.0485 |
Descriptive statistics
| Standard deviation | 1.114264876 |
|---|---|
| Coefficient of variation (CV) | 0.09239184829 |
| Kurtosis | -0.6361301072 |
| Mean | 12.06020766 |
| Median Absolute Deviation (MAD) | 0.532 |
| Skewness | 0.2693521219 |
| Sum | 267628.0682 |
| Variance | 1.241586215 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 12.122 | 32 | 0.1% |
| 12.165 | 31 | 0.1% |
| 11.908 | 29 | 0.1% |
| 11.795 | 28 | 0.1% |
| 11.891 | 28 | 0.1% |
| 11.934 | 28 | 0.1% |
| 11.83 | 28 | 0.1% |
| 12.048 | 28 | 0.1% |
| 12.02 | 27 | 0.1% |
| 11.839 | 27 | 0.1% |
| Other values (3967) | 21905 |
| Value | Count | Frequency (%) |
| 9.8754 | 1 | |
| 9.9044 | 1 | |
| 9.9286 | 1 | |
| 9.9428 | 1 | |
| 9.9591 | 1 | |
| 9.9641 | 1 | |
| 9.969 | 1 | |
| 9.9759 | 1 | |
| 9.9852 | 1 | |
| 9.9854 | 1 |
| Value | Count | Frequency (%) |
| 15.081 | 1 | |
| 15.055 | 1 | |
| 15.043 | 1 | |
| 15.031 | 1 | |
| 15.002 | 1 | |
| 14.976 | 1 | |
| 14.958 | 1 | |
| 14.913 | 1 | |
| 14.908 | 1 | |
| 14.872 | 1 |
| Distinct | 18104 |
|---|---|
| Distinct (%) | 81.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.214390083 |
| Minimum | 0.00038751 |
|---|---|
| Maximum | 44.103 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 0.00038751 |
|---|---|
| 5-th percentile | 0.375205 |
| Q1 | 0.995375 |
| median | 1.5242 |
| Q3 | 2.5424 |
| 95-th percentile | 6.1675 |
| Maximum | 44.103 |
| Range | 44.10261249 |
| Interquartile range (IQR) | 1.547025 |
Descriptive statistics
| Standard deviation | 2.295746499 |
|---|---|
| Coefficient of variation (CV) | 1.036739876 |
| Kurtosis | 52.88783532 |
| Mean | 2.214390083 |
| Median Absolute Deviation (MAD) | 0.64708 |
| Skewness | 4.932390189 |
| Sum | 49139.53033 |
| Variance | 5.270451988 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.2398 | 6 | < 0.1% |
| 1.3318 | 6 | < 0.1% |
| 1.5777 | 5 | < 0.1% |
| 1.3204 | 5 | < 0.1% |
| 1.3241 | 5 | < 0.1% |
| 1.2288 | 5 | < 0.1% |
| 1.5724 | 5 | < 0.1% |
| 1.367 | 5 | < 0.1% |
| 1.4341 | 5 | < 0.1% |
| 1.0774 | 5 | < 0.1% |
| Other values (18094) | 22139 |
| Value | Count | Frequency (%) |
| 0.00038751 | 1 | |
| 0.0015935 | 1 | |
| 0.0036653 | 1 | |
| 0.0050334 | 1 | |
| 0.0061148 | 1 | |
| 0.007505 | 1 | |
| 0.0089313 | 1 | |
| 0.01096 | 1 | |
| 0.013457 | 1 | |
| 0.01644 | 1 |
| Value | Count | Frequency (%) |
| 44.103 | 1 | |
| 43.622 | 1 | |
| 43.428 | 1 | |
| 43.397 | 1 | |
| 39.05 | 1 | |
| 37.746 | 1 | |
| 35.045 | 1 | |
| 35.019 | 1 | |
| 34.496 | 1 | |
| 34.467 | 1 |
| Distinct | 16359 |
|---|---|
| Distinct (%) | 73.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.77652873 |
| Minimum | 27.765 |
|---|---|
| Maximum | 119.91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 173.5 KiB |
Quantile statistics
| Minimum | 27.765 |
|---|---|
| 5-th percentile | 53.694 |
| Q1 | 61.548 |
| median | 67.096 |
| Q3 | 74.572 |
| 95-th percentile | 87.1325 |
| Maximum | 119.91 |
| Range | 92.145 |
| Interquartile range (IQR) | 13.024 |
Descriptive statistics
| Standard deviation | 11.03623121 |
|---|---|
| Coefficient of variation (CV) | 0.1604650804 |
| Kurtosis | 2.470670204 |
| Mean | 68.77652873 |
| Median Absolute Deviation (MAD) | 6.364 |
| Skewness | 1.113054702 |
| Sum | 1526219.949 |
| Variance | 121.7983994 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 60.38 | 104 | 0.5% |
| 64.109 | 7 | < 0.1% |
| 66.012 | 6 | < 0.1% |
| 61.579 | 6 | < 0.1% |
| 66.696 | 6 | < 0.1% |
| 77.17 | 6 | < 0.1% |
| 66.264 | 6 | < 0.1% |
| 61.489 | 6 | < 0.1% |
| 60.295 | 6 | < 0.1% |
| 56.644 | 6 | < 0.1% |
| Other values (16349) | 22032 |
| Value | Count | Frequency (%) |
| 27.765 | 1 | |
| 41.777 | 1 | |
| 42.093 | 1 | |
| 43.198 | 1 | |
| 43.226 | 1 | |
| 43.242 | 1 | |
| 43.247 | 1 | |
| 43.274 | 1 | |
| 43.471 | 1 | |
| 43.7 | 1 |
| Value | Count | Frequency (%) |
| 119.91 | 1 | |
| 119.9 | 1 | |
| 119.89 | 1 | |
| 119.79 | 1 | |
| 119.48 | 1 | |
| 119.43 | 1 | |
| 119.41 | 1 | |
| 119.39 | 1 | |
| 119.32 | 1 | |
| 119.28 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| AT | AP | AH | AFDP | GTEP | TIT | TAT | TEY | CDP | CO | NOX | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4.5878 | 1018.7 | 83.675 | 3.5758 | 23.979 | 1086.2 | 549.83 | 134.67 | 11.898 | 0.32663 | 81.952 |
| 1 | 4.2932 | 1018.3 | 84.235 | 3.5709 | 23.951 | 1086.1 | 550.05 | 134.67 | 11.892 | 0.44784 | 82.377 |
| 2 | 3.9045 | 1018.4 | 84.858 | 3.5828 | 23.990 | 1086.5 | 550.19 | 135.10 | 12.042 | 0.45144 | 83.776 |
| 3 | 3.7436 | 1018.3 | 85.434 | 3.5808 | 23.911 | 1086.5 | 550.17 | 135.03 | 11.990 | 0.23107 | 82.505 |
| 4 | 3.7516 | 1017.8 | 85.182 | 3.5781 | 23.917 | 1085.9 | 550.00 | 134.67 | 11.910 | 0.26747 | 82.028 |
| 5 | 3.8858 | 1017.7 | 83.946 | 3.5824 | 23.903 | 1086.0 | 549.98 | 134.67 | 11.868 | 0.23473 | 81.748 |
| 6 | 3.6697 | 1018.0 | 84.114 | 3.5804 | 23.889 | 1085.9 | 550.04 | 134.68 | 11.877 | 0.44412 | 84.592 |
| 7 | 3.5892 | 1018.2 | 83.867 | 3.5777 | 23.876 | 1086.0 | 549.88 | 134.66 | 11.893 | 0.79996 | 84.193 |
| 8 | 3.7108 | 1018.5 | 84.948 | 3.6027 | 23.957 | 1086.3 | 549.98 | 134.65 | 11.870 | 0.68996 | 83.978 |
| 9 | 4.8281 | 1018.5 | 85.346 | 3.5158 | 23.422 | 1083.1 | 549.80 | 132.67 | 11.694 | 1.02810 | 82.654 |
Last rows
| AT | AP | AH | AFDP | GTEP | TIT | TAT | TEY | CDP | CO | NOX | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 22181 | 7.1267 | 1022.4 | 85.064 | 4.6292 | 36.341 | 1100.5 | 523.17 | 170.48 | 14.791 | 1.4063 | 80.600 |
| 22182 | 7.2022 | 1023.6 | 80.694 | 4.6069 | 36.399 | 1099.6 | 522.05 | 170.56 | 14.824 | 1.4269 | 79.918 |
| 22183 | 6.3239 | 1024.7 | 88.633 | 4.6440 | 36.657 | 1099.6 | 521.08 | 172.04 | 14.867 | 1.6709 | 79.344 |
| 22184 | 5.6777 | 1025.3 | 92.704 | 4.6708 | 36.803 | 1099.8 | 521.11 | 172.54 | 14.849 | 1.5296 | 80.540 |
| 22185 | 5.4158 | 1026.1 | 82.718 | 4.6417 | 36.950 | 1100.0 | 521.10 | 172.96 | 14.811 | 1.4415 | 80.553 |
| 22186 | 4.8631 | 1027.0 | 81.084 | 4.2825 | 34.045 | 1100.0 | 529.98 | 168.38 | 14.290 | 1.2538 | 78.397 |
| 22187 | 4.5173 | 1027.4 | 80.813 | 4.2481 | 33.904 | 1100.1 | 530.47 | 168.07 | 14.344 | 1.0808 | 78.251 |
| 22188 | 4.2717 | 1027.9 | 80.380 | 4.2817 | 34.165 | 1099.9 | 529.56 | 168.55 | 14.395 | 1.0472 | 77.269 |
| 22189 | 4.0853 | 1028.6 | 78.907 | 4.2313 | 33.802 | 1100.1 | 530.61 | 167.98 | 14.343 | 1.0875 | 77.985 |
| 22190 | 4.2148 | 1029.4 | 70.679 | 4.2049 | 33.768 | 1100.0 | 530.97 | 167.30 | 14.291 | 1.1337 | 78.950 |